Search CORE

HAL Evry

EzArray: A web-based highly automated Affymetrix expression array data management and analysis system

Author: A Brazma
BM Bolstad
C Li
C Romualdi
CM Kendziorski
D Rajagopalan
E Hubbell
GK Smyth
H Rehrauer
HM Hsueh
J Rainer
JM Vaquerizas
JM Wettenhall
K Hokamp
L Jones
M Kapushesky
M Psarros
MA Newton
O Larsson
R Diaz-Uriarte
R Edgar
R Ihaka
RA Irizarry
RA Irizarry
S Dudoit
S Vardhanabhuti
S Zhang
VG Tusher
Wei Xu
WK Lim
WM Liu
X Xia
Y Barash
Yuelin Zhu
Yuerong Zhu
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Though microarray experiments are very popular in life science research, managing and analyzing microarray data are still challenging tasks for many biologists. Most microarray programs require users to have sophisticated knowledge of mathematics, statistics and computer skills for usage. With accumulating microarray data deposited in public databases, easy-to-use programs to re-analyze previously published microarray data are in high demand. Results EzArray is a web-based Affymetrix expression array data management and analysis system for researchers who need to organize microarray data efficiently and get data analyzed instantly. EzArray organizes microarray data into projects that can be analyzed online with predefined or custom procedures. EzArray performs data preprocessing and detection of differentially expressed genes with statistical methods. All analysis procedures are optimized and highly automated so that even novice users with limited pre-knowledge of microarray data analysis can complete initial analysis quickly. Since all input files, analysis parameters, and executed scripts can be downloaded, EzArray provides maximum reproducibility for each analysis. In addition, EzArray integrates with Gene Expression Omnibus (GEO) and allows instantaneous re-analysis of published array data. Conclusion EzArray is a novel Affymetrix expression array data analysis and sharing system. EzArray provides easy-to-use tools for re-analyzing published microarray data and will help both novice and experienced users perform initial analysis of their microarray data from the location of data storage. We believe EzArray will be a useful system for facilities with microarray services and laboratories with multiple members involved in microarray data analysis. EzArray is freely available from <url>http://www.ezarray.com/</url>.</p

Springer - Publisher Connector

A Reporter Screen in a Human Haploid Cell Line Identifies CYLD as a Constitutive Inhibitor of NF-κB

Author: A Jolma
A Kovalenko
A Sawada
Alexander Poltorak
Clarissa C. Lee
CP Guimaraes
DM Rosmarin
E Trompouki
F Abascal
GA Maston
Hidde L. Ploegh
Jan E. Carette
JE Carette
JE Carette
JE Carette
JM Vaquerizas
K Tominaga
L Tornatore
LM Duncan
LT Jae
M Kotecki
MS Hayden
N Hövelmeyer
P Papatheodorou
RG Baker
S Vallabhapurapu
SC Sun
T Hayashi
Thijn R. Brummelkamp
TR Brummelkamp
W Jin
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

The development of forward genetic screens in human haploid cells has the potential to transform our understanding of the genetic basis of cellular processes unique to man. So far, this approach has been limited mostly to the identification of genes that mediate cell death in response to a lethal agent, likely due to the ease with which this phenotype can be observed. Here, we perform the first reporter screen in the near-haploid KBM7 cell line to identify constitutive inhibitors of NF-κB. CYLD was the only currently known negative regulator of NF-κB to be identified, thus uniquely distinguishing this gene. Also identified were three genes with no previous known connection to NF-κB. Our results demonstrate that reporter screens in haploid human cells can be applied to investigate the many complex signaling pathways that converge upon transcription factors

DSpace@MIT

Maastricht University Research Portal

Sex-specific associations between particulate matter exposure and gene expression in independent discovery and validation cohorts of middle-aged men and women

BACKGROUND: Particulate matter (PM) exposure leads to premature death, mainly due to respiratory and cardiovascular diseases. OBJECTIVES: Identification of transcriptomic biomarkers of air pollution exposure and effect in a healthy adult population. METHODS: Microarray analyses were performed in 98 healthy volunteers (48 men, 50 women). The expression of eight sex-specific candidate biomarker genes (significantly associated with PM(10) in the discovery cohort and with a reported link to air pollution-related disease) was measured with qPCR in an independent validation cohort (75 men, 94 women). Pathway analysis was performed using Gene Set Enrichment Analysis. Average daily PM(2.5) and PM(10) exposures over 2-years were estimated for each participant’s residential address using spatiotemporal interpolation in combination with a dispersion model. RESULTS: Average long-term PM(10) was 25.9 (± 5.4) and 23.7 (± 2.3) μg/m(3) in the discovery and validation cohorts, respectively. In discovery analysis, associations between PM(10) and the expression of individual genes differed by sex. In the validation cohort, long-term PM(10) was associated with the expression of DNAJB5 and EAPP in men and ARHGAP4 (p = 0.053) in women. AKAP6 and LIMK1 were significantly associated with PM(10) in women, although associations differed in direction between the discovery and validation cohorts. Expression of the eight candidate genes in the discovery cohort differentiated between validation cohort participants with high versus low PM(10) exposure (area under the receiver operating curve = 0.92; 95% CI: 0.85, 1.00; p = 0.0002 in men, 0.86; 95% CI: 0.76, 0.96; p = 0.004 in women). CONCLUSIONS: Expression of the sex-specific candidate genes identified in the discovery population predicted PM(10) exposure in an independent cohort of adults from the same area. Confirmation in other populations may further support this as a new approach for exposure assessment, and may contribute to the discovery of molecular mechanisms for PM-induced health effects. CITATION: Vrijens K, Winckelmans E, Tsamou M, Baeyens W, De Boever P, Jennen D, de Kok TM, Den Hond E, Lefebvre W, Plusquin M, Reynders H, Schoeters G, Van Larebeke N, Vanpoucke C, Kleinjans J, Nawrot TS. 2017. Sex-specific associations between particulate matter exposure and gene expression in independent discovery and validation cohorts of middle-aged men and women. Environ Health Perspect 125:660–669; http://dx.doi.org/10.1289/EHP37

Institutional Repository Universiteit Antwerpen

University of Southern Denmark Research Output

Genome Expression Pathway Analysis Tool – Analysis and visualization of microarray gene expression data under genomic, proteomic and metabolic context

Author: A Rosenwald
AA Alizadeh
AI Saeed
B Mlecnik
B Zhang
BM Bolstad
C von Mering
F Al-Shahrour
Gene Ontology Consortium
GJ Dennis
GK Smyth
J Rainer
JM Vaquerizas
Julia C Engelmann
Jörg Schultz
M Kanehisa
M Kapushesky
M Kotera
M Masseroli
M Pelizzola
Markus Weniger
O Troyanskaya
P Khatri
P Lichter
P Shannon
R Gentleman
R Shamir
S Bea
SW Doniger
TJP Hubbard
W Huber
YH Yang
Publication venue: BioMed Central
Publication date: 01/06/2007
Field of study

Abstract Background Regulation of gene expression is relevant to many areas of biology and medicine, in the study of treatments, diseases, and developmental stages. Microarrays can be used to measure the expression level of thousands of mRNAs at the same time, allowing insight into or comparison of different cellular conditions. The data derived out of microarray experiments is highly dimensional and often noisy, and interpretation of the results can get intricate. Although programs for the statistical analysis of microarray data exist, most of them lack an integration of analysis results and biological interpretation. Results We have developed GEPAT, Genome Expression Pathway Analysis Tool, offering an analysis of gene expression data under genomic, proteomic and metabolic context. We provide an integration of statistical methods for data import and data analysis together with a biological interpretation for subsets of probes or single probes on the chip. GEPAT imports various types of oligonucleotide and cDNA array data formats. Different normalization methods can be applied to the data, afterwards data annotation is performed. After import, GEPAT offers various statistical data analysis methods, as hierarchical, k-means and PCA clustering, a linear model based t-test or chromosomal profile comparison. The results of the analysis can be interpreted by enrichment of biological terms, pathway analysis or interaction networks. Different biological databases are included, to give various information for each probe on the chip. GEPAT offers no linear work flow, but allows the usage of any subset of probes and samples as a start for a new data analysis. GEPAT relies on established data analysis packages, offers a modular approach for an easy extension, and can be run on a computer grid to allow a large number of users. It is freely available under the LGPL open source license for academic and commercial users at <url>http://gepat.sourceforge.net</url>. Conclusion GEPAT is a modular, scalable and professional-grade software integrating analysis and interpretation of microarray gene expression data. An installation available for academic users can be found at <url>http://gepat.bioapps.biozentrum.uni-wuerzburg.de</url>.</p

University of Regensburg Publication Server

The Characterisation of Three Types of Genes that Overlie Copy Number Variable Regions

Author: A Barski
A Gimelbrant
A Krek
A Necsulea
Alex Bateman
AM Andrés
Arkady B. Khodursky
B Schuster-Böckler
BE Stranger
C Cheng
Cara Woodwark
CM Carvalho
CN Henrichsen
D Betel
DF Conrad
DP Bartel
E Allemand
E Eisenberg
FM Pauler
GL Papadopoulos
IM Morison
J Li
J Zhu
JM Vaquerizas
K Chen
KL Wright
L Patthy
M Megraw
M Ruault
MJ Moore
R Redon
RC Friedman
S Haider
S Ohno
S Vasudevan
TJ Hubbard
W Huang da
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Background: Due to the increased accuracy of Copy Number Variable region (CNV) break point mapping, it is now possible to say with a reasonable degree of confidence whether a gene (i) falls entirely within a CNV; (ii) overlaps the CNV or (iii) actually contains the CNV. We classify these as type I, II and III CNV genes respectively. Principal Findings: Here we show that although type I genes vary in copy number along with the CNV, most of these type I genes have the same expression levels as wild type copy numbers of the gene. These genes must, therefore, be under homeostatic dosage compensation control. Looking into possible mechanisms for the regulation of gene expression we found that type I genes have a significant paucity of genes regulated by miRNAs and are not significantly enriched for monoallelically expressed genes. Type III genes, on the other hand, have a significant excess of genes regulated by miRNAs and are enriched for genes that are monoallelically expressed. Significance: Many diseases and genomic disorders are associated with CNVs so a better understanding of the different ways genes are associated with normal CNVs will help focus on candidate genes in genome wide association studies

Unveiling transcription factor regulation and differential co-expression genes in Duchenne muscular dystrophy

Author: AR Burr
B Cai
B Wong
Baoshan Huang
C Cerella
C Dogra
Chuanling Zhang
DB Davis
DJ Blake
EP Hoffman
F Altamirano
GK Smyth
HB An
J Yang
JM Vaquerizas
JN Haslett
JR Gorospe
Junhua Cao
K Bushby
Lijun Tian
M Pescatori
MC Monici
MJ Spencer
RA Irizarry
RM McDouall
SI Head
Tong Qian
V Cheriyath
Xianxiang Song
Xingqiang Deng
Y Benjamini
YW Chen
YW Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

The Annotation, Mapping, Expression and Network (AMEN) suite of tools for molecular systems biology

Abstract Background High-throughput genome biological experiments yield large and multifaceted datasets that require flexible and user-friendly analysis tools to facilitate their interpretation by life scientists. Many solutions currently exist, but they are often limited to specific steps in the complex process of data management and analysis and some require extensive informatics skills to be installed and run efficiently. Results We developed the Annotation, Mapping, Expression and Network (AMEN) software as a stand-alone, unified suite of tools that enables biological and medical researchers with basic bioinformatics training to manage and explore genome annotation, chromosomal mapping, protein-protein interaction, expression profiling and proteomics data. The current version provides modules for (i) uploading and pre-processing data from microarray expression profiling experiments, (ii) detecting groups of significantly co-expressed genes, and (iii) searching for enrichment of functional annotations within those groups. Moreover, the user interface is designed to simultaneously visualize several types of data such as protein-protein interaction networks in conjunction with expression profiles and cellular co-localization patterns. We have successfully applied the program to interpret expression profiling data from budding yeast, rodents and human. Conclusion AMEN is an innovative solution for molecular systems biological data analysis freely available under the GNU license. The program is available via a website at the Sourceforge portal which includes a user guide with concrete examples, links to external databases and helpful comments to implement additional functionalities. We emphasize that AMEN will continue to be developed and maintained by our laboratory because it has proven to be extremely useful for our genome biological research program.</p

Springer - Publisher Connector

HAL-Inserm

Institutional Repository of the Freie Universität Berlin

HAL-Rennes 1

A systematic, large-scale comparison of transcription factor binding site models

Background The modelling of gene regulation is a major challenge in biomedical research. This process is dominated by transcription factors (TFs) and mutations in their binding sites (TFBSs) may cause the misregulation of genes, eventually leading to disease. The consequences of DNA variants on TF binding are modelled in silico using binding matrices, but it remains unclear whether these are capable of accurately representing in vivo binding. In this study, we present a systematic comparison of binding models for 82 human TFs from three freely available sources: JASPAR matrices, HT-SELEX-generated models and matrices derived from protein binding microarrays (PBMs). We determined their ability to detect experimentally verified “real” in vivo TFBSs derived from ENCODE ChIP-seq data. As negative controls we chose random downstream exonic sequences, which are unlikely to harbour TFBS. All models were assessed by receiver operating characteristics (ROC) analysis. Results While the area- under-curve was low for most of the tested models with only 47 % reaching a score of 0.7 or higher, we noticed strong differences between the various position-specific scoring matrices with JASPAR and HT-SELEX models showing higher success rates than PBM-derived models. In addition, we found that while TFBS sequences showed a higher degree of conservation than randomly chosen sequences, there was a high variability between individual TFBSs. Conclusions Our results show that only few of the matrix-based models used to predict potential TFBS are able to reliably detect experimentally confirmed TFBS. We compiled our findings in a freely accessible web application called ePOSSUM (http:/mutationtaster.charite.de/ePOSSUM/) which uses a Bayes classifier to assess the impact of genetic alterations on TF binding in user-defined sequences. Additionally, ePOSSUM provides information on the reliability of the prediction using our test set of experimentally confirmed binding sites

Springer - Publisher Connector

EBP1 Is a Novel E2F Target Gene Regulated by Transforming Growth Factor-β

Author: A Iavarone
A Lacerte
A Rabinovich
A Sacco
AW Hamburger
B Sun
C Gerard
C-R Chen
David Judah
DL Burkhart
HJ de Jonge
IA Ivanova
JL Lavrrar
JM Vaquerizas
JY Ahn
L Dagnino
Lina Dagnino
MD Apostolova
ME McLaughlin-Drubin
Mikhail V. Blagosklonny
N Radomski
N Zheng
P Chomczynski
PJ Farnham
S Emmrich
SF Dobrowolski
SK Nordeen
TC Hallstrom
TP Monie
VA Swiss
W Zhu
Wing Y. Chang
WY Chang
WY Chang
X Xia
Y Lu
Y Tao
Y Zhang
Y Zhang
Z Kherrouche
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Regulation of gene expression requires transcription factor binding to specific DNA elements, and a large body of work has focused on the identification of such sequences. However, it is becoming increasingly clear that eukaryotic transcription factors can exhibit widespread, nonfunctional binding to genomic DNA sites. Conversely, some of these proteins, such as E2F, can also modulate gene expression by binding to non-consensus elements. E2F comprises a family of transcription factors that play key roles in a wide variety of cellular functions, including survival, differentiation, activation during tissue regeneration, metabolism, and proliferation. E2F factors bind to the Erb3-binding protein 1 (EBP1) promoter in live cells. We now show that E2F binding to the EBP1 promoter occurs through two tandem DNA elements that do not conform to typical consensus E2F motifs. Exogenously expressed E2F1 activates EBP1 reporters lacking one, but not both sites, suggesting a degree of redundancy under certain conditions. E2F1 increases the levels of endogenous EBP1 mRNA in breast carcinoma and other transformed cell lines. In contrast, in non-transformed primary epidermal keratinocytes, E2F, together with the retinoblastoma family of proteins, appears to be involved in decreasing EBP1 mRNA abundance in response to growth inhibition by transforming growth factor-β1. Thus, E2F is likely a central coordinator of multiple responses that culminate in regulation of EBP1 gene expression, and which may vary depending on cell type and context

Scholarship@Western